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@ Compression and expansion of digitized voice signals. 

@ Apparatus and a method are disclosed for digitizing a 
voice signal as it occurs and the digitized signal is compress- 
ed and stored for subsequent reconversion to a voice signal 
at a latter time. Binary words derived from digitizing the 
signal in an analog-to-digital converter represent the voice 
non lineary and are first linearized by a first processor and 
then stored in a first memory until a block of binary words 
are assembled. Next the block of binary words are transfer- 
red to a second buffer memory and a second processor 
compresses it using time domain harmonic scaling proces- 
sing which extrarts the pitch of the original voice and also 
compresses the voice frequency spectrum to affect a 2:1 
reduction in the number of stored binary words representing 
the digitized voice signal. This reduced amout of data is then 
upsampled by a factor of four using linear interpolation to 
derive a pseudo higher sample rate required for further 
processing using continuously variable slope delta modula- 
tion processing which yields one binary bit out for every 



binary word processed. The resultant bits are assembled into 
eight bit words and then stored in a third memory. Another 
binary word representing the pitch is added and the result is 
a 4:1 compression of the original digitized data. The 
compressed digitized data is stored in a bulk memory. 
Thereafter, more digitized samples of the voice signal 
undergo this same processing. Compressed digitized voice 
signals may be reconverted to a voice signal for playback by 
processing the digitized and compressed voice signals 
generally in the reverse order. Playback speed control may 
also be accomplished without changing the pitch of the 
voice. To double the playback speed the time domain 
harmonic scaling processing is deleted in playback, thus 
leaving the frequency spectrum compressed and a 2:1 
reduction in binary words representing the digitized voice. 
To slow the playback speed the digitized voice signals 
undergo modified TDHS processing. 
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Wang Laboratories, Inc., Lowell, MA, USA EP-57 570 

COMPRESSION AND EXPANSION OF DIGITIZED VOICE SIGNALS 

5 

This invention generally relates to the digitization of continuously 
varying signals which are subsequently reconverted to continuously varying 
signals, hfore particularly, this invention relates to coTnpression of 
digitized analog signals and subsequent expansion of the compressed 
10 digitized analog signals for reconversion to analog signals. 

In the telecommunications field there has been a continuing trend 
touBrd the use of digital transmission techniques wherein -analog signals, 
particularly voice signals, are digitized, transmitted and then 

15 reconverted to a sufficiently accurate representation of the original 

voice signal to be delivered at the receiving end of a telecommunications 
link. Such digitization of voice signals has permitted multiplexing a 
large number of voice band channels together, and the switching of those 
multiplexed voice band channels can then be performed economically using 

20 digital techniques. Presently, these voice digitization techniques have 
been applied primarily to long distance transmission wherein transmission 
costs dominate and significant savings are realized by utilizing digital 
transmisssion techniques of digitized voice signals. As the size and cost 
of microprocessors and memory keep decreasing it is becoming economically 

25 feasible to extend these digital techniques for processing analog signals 
into the terminal equipment at both ends of a telecommunications link. 
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Within ten to fifteen years it is estimated that digital transmission 
techniques and equipment will completely dominate the telecommunications 
field from the subscriber terminal equipment through switching equipment 
to the -long distance transmission equipment. 

To further extend the usefulness of digital techniques in the 
telecommunications field much work is being done to reduce the number of 
bits in the digital signal representing a speech signal without impairing 
reproduction of the voice signal that the digital signal was originally 

10 derived from. This is done by signal processing of the continuously 

varying voice signals and the digital signals derived therefran to achieve 
the bit-rate reductions. Different techniques have been developed for the 
bit -rate reduction of digitized voice signals which are generally referred 
to as waveform coding techniques. These waveform coding techniques 

15 include adaptive differential pulse code modulation, sub -band coding and 
transform coding,- all of which can typically reduce the bit-rate of a 
digitized voice signal by more than a factor of two. Another technique 
used in the Bell System is time assignment speech interpolation (TASI] in 
which silence intervals within speech are detected and not transmitted, 

20 Yet another technique is vocoding in which speech is analyzed to extract 
its essential parameters followed by synthesis to reconstruct the speech. 
Presently, vocoder techniques do not offer the naturalness of spealcer 
characteristics and its use is limited to applications where extremely low 
bit rates are dictated, such as in secure voice transmission. 

25 

Speech processing and digitization to reduce the number of binary 
bits making up the digitized voice signal are also being applied to a 
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relatively new field wherein digitized voice signals are stored and then 
■read out for reconstruction into voice signals at a later time. This is a 
store -and-forward approach wherein voice messages are digitally stored for 
later delivery to a receiving party via the telephone, similar to the 
5 service provided by electronic mail, Store-and-forward systems have been 
developed by a number of companies including Wang, IBM, Bell System and 
™, Inc. 

In the store-and-forward systems large capacity memories are required 
10 and presently relatively expensive disk storage is utilized. Without 
bit-rate reduction techniques being applied to a digitized voice signal, 
speech may typically be digitized at 64 kilobits /second and every two 
minutes worth of digitized speech requires one megabyte of memory space 
for storage. It can readily be seen that extensive amounts of memory are 
15 required for a viable store-and-forward system and there is a need for 

techniques to reduce the amount of memory required to store a given amount 
of digitized voice. One approach has been to store only the active speech 
but this provides only a small amount of signal compression. The 
previously mentioned adaptive differential pulse code modulation, sub-band 
20 coding and transform coding techniques can compress the digitized signals 
by a factor of two which is a significant savings but there is still a 
need in the art to further compress digitized voice signals to minimize 
the amount of memory required to store these signals. 
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The above need in the prior art is met by the invention as 

defined in the appended claims which' are • ihcorporated herein by reference, 
viiich - 

utilizes prior art bit rate compression techniques to achieve a 4:1 
5 reduction in the binary signals representing a digitized voice signal. 
The preferred embodiment of the invention utilizes the Time Domain 
Harmonic Scaling technique (TDHS) which is used by itself in the prior art 
to achieve a 2:1 reduction of the binary data representing a digitized 
voice signal. In addition, we utilize the Continuously Variable Slope 
10 Delta modulation technique (CVSDl, which is also normally used by itself 
to achieve bit rate reductions. TE^ processing comprises Time Domain 
Pfermonic Compression (TDHC) processing for data compression and Time 
Domain Harmonic Expansion ClDHE) processing for data expansion. In 
addition, a known technique of converting an eight bit output from a 
15 non -linear analog to digital converter to a twelve bit number which 

linearizes the output from the analog to digital converter is also used. 
Further, a linear interpolation upsampling technique is utilized to 
increase the data sampling rate required for operation of Continuously 
Variable Slope Delta modulation. 

20 

Initially a voice, audio or other analog signal that continuously 
varies is sampled eight thousand times per second and each sample is 
linearly converted to a binary number. This is accomplished by first 
passing the voice signal through a conventional analog-to-digital 
25 converter which provides an eight bit binary nunber output in u-25S LAls 
PCM form for each sample. The analog -to-digital converter inherently is 
non-linear in nature so the digitized voice signal output therefron is 
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processed further utilizing a first microprocessor to remove the effects 
of the non-linear conversion hy the analog-to-digital converter. This 
linearization of the digitized voice converts each eight bit binary number 
to a twelve bit binaiy number and a first number of these twelve bit 
5 binary nunbers, representing digitized samples of the audio signal, are 
collected in a first buffer memory. This is done in real time. Vstien the 
first buffer memory has a predetermined number of binary numbers stored 
therein, representing a given time segment of speech, the binary words are 
transferred to a second buffer memory associated with a second high speed 

10 processor. This second processor implements the pitch detection of the 
TEHC technique, which uses autocorrelation to extract the pitch value of 
the voice signal being input to our novel apparatus. In addition, using 
triangular weighting, TDHC accomplishes a bit rate reduction with the 
first number of binary numbers representing the sampled voice signal being 

15 halved. This is a 2:1 bit rate reduction yielding four thousand samples 
per second. Further in the signal processing OJSD processing is performed 
which normally requires a sampling rate input of at least thirty-two 
thousand sainples per second in order to function properly. The output 
from the T1>1C -processing does not provide a high enoi^gh sample tare input 

20 for the proper operation of the C\'SD processing. We therefore upsarnple 
the output of the lYHC processing by a factor of four to effectively 
provide an output of sixteen thousand samples per second. This is 
accomplished using an interpolation technique, well known in the art, to 
generate binary numbers between the binary numbers output from the TDHC 

25 processing. The result of the upsampling is sixteen thousand binary 

nixTiber samples per second, but due to the audio spectrum being compressed 
by a 2:1 factor by the TDHC processing, the CVSD processing operates as if 
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it has a digitized audio signal at the rate of thirty-two thousand samples 
per second applied thereto. The CV5D processing provides one output bit 
for every binary word input thereto. The individual bits output from the 
CVSD processing are assembled into binary words by a serial/parallel 
5 converter. When a complete binary word has been assCTibled in the 

serial /parallel converter it is transferred out of the converter to a bulk 
memory. In this manner, each of the twelve bit binary nunbers stored in 
the first buffer xnemory is converted to a binary bit in one of the binary 
numbers placed in the bullc 'memory. Also stored in the bulk memory with 

10 the binary ntHnbers assembled in the converter is the value of the pitch 
period in binary nunber form, as determined during TDHC processing. Due 
to the operation of the invention, cne achieves a4:l bit rate reduction of 
the original digitized voice signal. Thus less memory is required to 
store the digitized and compressed voice signals than heretofore possible 

15 in the prior art. 

To expand the digitized and compressed version of a voice signal the 
contents of the bulk memory are read out one word at a time and input to 
the first buffer memory via a EM^ interface under control of the first 

20 miroprocessor- The data is then moved from the first buffer memory to the 
aforementioned serial /parallel converter which is now operated to provide 
parallel to serial conversion. The CVSD processing is now operated in an 
inverse mode, as is known in the art, and takes each bit of the 

binary words output from the parallel to serial conversion and reconverts 

2S them to a binary msnber. The binary ninnbers output from the inverse OfSD 
processing are now downsampled to remove the interspersed binary nunbers 
created by interpolation during upsampling. The binary nijnbers remaining 
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after the 4:1 downsampling now undergo the Time Domain Harmonic Expansion 
(TEHE) processing. The TDHE processing utilizes the digitized value of 
pitch which has also been read out from the bulV memory and along with the 
downsainpled binary signals recreates the linearized first number of binary 
5 words representing the voice signal which were originally processed by 
TEHC processing. The binary words generated by the TEHE processing are 
then delinearized back to u-255 L^^ PCM in the second processor before 
being applied to the converter which is now operating in a 
digital-to-analog mode. The output of the converter is an accurate 
10 representation of the voice signal initially digitized, compressed and 
stored utilizing the teaching of our invention. 

Another feature of the invention is the ability to provide speed 
control when digitized and compressed voice signals are being reconverted 

15 to voice signals, Vi'hen it is desired to double the playback speed the 
binary words resulting from the downsampling do not undergo TDHE 
processing to double the number of binary words, but rather are 
delinearized and sent directly to the digital-to-analog converter to be 
reconverted to a voice signal. ^Vhen it is desired to slow the playback 

20 speed, TCHE processing is done a little different than when playing speech 
back at normal speed. 



25 invention will be better understood upon reading the following 

Detailed Description in conjunction with the drawing in which 
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Figure 1 is a detailed block diagram of the hardware elements 
implementing the invention; 

Fignre 2 is a block diagram showing the signal processing steps 
5 being applied to digitize and cschjidsss voice signals; and 

Figure 3 shows a block diagram showing the signal processing steps 
for reconverting digitized and compressed voice signals back into the 
voice signal. 

10 

In Figure 1 is shown the block diagram of the preferred embodiment of 
our invention which generally comprising a programmable signal processor 9 

15 functioning with a host processor 7 which controls the storage or 

transmission of digitized and compressed voice output from processor 9. 
fbst processor 7 can be any coniradri' processor configuration and herein 
comprises control input 22, host microprocessor 23, program memory 24, 
output channel 25, input channel 26, and storage msnory 34 which are all 

20 interconnected by and function over an eight bit bus 27. Control input 22 
comprises any type of control input to the system, such as keys, switches 
or a keyboard. These inputs are used to activate processor 9 and to 
select the playback speed when digital signals are reconverted to voice 
signals. Program memory 24 contains the program which controls the 

25 operation of the host microprocessor 23, Output channel 25 is a path over 
which voice signals that have been digitized and compressed may be output 
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to external meniory devices or to a data communication channel [both not 
shown]. Similarly, input channel 26 may be used to receive digitized and 
compressed voice signals from an external memory or from a data 
communication channel [both not shown] to either be stored in memory 34 or 
5 to be converted back to a voice signal by processor 9. Memory 34 is a 
bulk memory of large capacity used to store the digitized and compressed 
voice signals generated by processor 9 or received from an external source 
via input channel 26. Host processor 7 communicates with processor 9 via 
control and status circuit 36 and the eight wires designated 28 which 

10 collectively tie buss 27 of processor 7 to a buss 19 processor 9. EMA 
interface circuit 30 and serial /parallel converter 31 cooperate and are 
controlled to transfer digitized and compressed voice signals between host 
processor 7 and processor 9. The operation of these circuits is described 
in detail further in this specification. Processor 9 is made up of two 

15 basic sections. The first section comprises a converter 11 which can 
function as either an analog-to-digital converter or a digital-to-analog 
converter, buffer memory 12 and input /output processor 13. Input /output 
processor 13 is an Intel 8089 I/O controller which controls circuits 11 
and 12 per a program stored in program memory 18 to perform the basic 

20 digitization of voice signals input to the voice digitizer from analog 
input 10. Converter 11 is a National 3054 Codec/Filter Combination Chip. 
Analog input 10 may be a microphone or any other source of voice or any 
other continuously varying signals. Similarly, input /output processor 13 
controls circuits 11 and 12 when the processor 9 is in a playback mode to 

25 reconvert the digital signals to voice signals to be applied to analog 
output circuit 33 which may be a speaker, tape recorder or other device. 
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The other portion of processor 9 is microprocessor 14 which is the Texas 
Instruments IMS-SZO processor. Processor 14 has an internal data memory 

15 which serves as a buffer iDonory which is controlled by microprocessor 

16 via' bus 21. Multiplier circuit 17 is a 16 X 16 parallel multiplier 

5 with a 32 bit product which performs the basic high speed number crunching 
required for real time binary number corapression. Qrcuits 15, 16 and 17 
in TMS-SZO processor 14 coriHnunicate via a bus 20. The programs 
controlling microprocessor 16 and I/O processor 14 are both stored in 
program memory 18. Processor 14, program memory 18 and the basic 
10 digitizing circuits 11, 12 and 13 are all tied to and communicate over a 
sixteen bit bus 19- 

In basic operation a voice signal input via analog input circuit 10 
is sampled and converted to a multiplicity of digital words in a manner 

15 known in the art by circuits 11, 12 and 13 whose operation is described in 
greater detail hereinafter. These digitized voice signals are then 
forwarded to processor 14 via bus 19. Processor 14 signal processes the 
digitized voice signals to achieve the 4:1 data compression that is 
achieved using the teaching of the invention. The digitized and 

20 compressed voice signals output from processor 14 are output one bit at a 
time and are assembled into eight bit binary words by a serial -to-parallel 
converter 31. When serial -to-parallel converter 31 is full its contents 
are shifted in parallel format via IMA. interface circuit 30 and host 
processor bus 27 to either bulk memory 34 or via output 25 to either an 

25 external memory or to a data comraunications link. In playback mode 

digitized and ccnipressed voice signals are obtained from memory 34 or are 
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input from an external source via input 26 and are inter imly stored in 
buffer memory 12, then forwarded to converter 31 which is now operated as 
a parallel -to-serial converter to feed the data bit by bit via bus 19 to 
processor 14. Processor 14 processes the bit information input thereto in 
5 playback mode to decompress the data, then delinearizes this data to u-255 

LAW PCM form, and finally forwards the delinearized data directly to 
. circuit 11 which now operates as a digital-to-analog converter which 
converts the binar)-- words to a voice signal applied to analog output 33. 
All these circuits just briefly outlined hereinabove operate in real time 

10 and digitize and compress voice as it is input to programmable signal 
processor 9 via analog input 10. As previously mentioned, in playback 
mode an operator applying an instruction via control input 22 may effect 
the operation of microprocessor 14 such that a voice signal output at 
analog output 53 may be speeded up or slowed down but with the same voice 

15 pitch. 

With this basic understanding of the operation of host processor 7 
and processor 9 I now describe the circuits shown in Figure 1 in greater 
detail. 

20 

A voice signal input to processor 9 via analog input 10 is applied to 
the National converter identified above that can either function as an 
analog-to-digital converter or a digital-to-analog converter. It would be 
obvious to one skilled in the art that two separate converter circuits may 
25 be also utilized for these functions. In the preferred embodiment of the 
invention converter 11 digitizes voice signals by sainpling a voice signal 
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and digitizing the results in eight bit form at a rate of eight thousand 
times per second. The output from converter 11 is the known u-255 LMV FQA 
standard. As is also known this output has some inherent inaccuracy due 
to the- non-linear nature of the sampling and digitization process. To 
5 overcome this inherent inaccuracy a more expensive twelve bit digitizer 
may he utilized, but besides being more expensive such a digitizer also 
requires more space. To save both space and money v/e have utilized 
digitizer 11 and then used input/output processor 13 to process each eight 
bit word output fron digitizer 11, in a manner well known in the art, to 
10 generate a twelve bit binary word that includes correction for the 

non-linearity of digitizer 11. Hiis eight bit word to twelve bit word 
conversion is accomplished using a look up table in a manner well known in 
the art. 

15 As each binary word output from digitizer 11 is corrected for 

non-linearity by processor 13, approximately two hundred such twelve bit 
words are collected in a first part of buffer memory 12 which can hold a 
total of two-thousand such words. The actual block size is dependent i^n 
the pitch range of interest and can vary. When first buffer memory 12 has 

20 approximately two hundred words stored therein processor 13 causes the 
words to be forwarded via busses 19 and 20 to be stored in second buffer 
data memory 15 in 'IMS-520 processor 14. The next two hundred binary words 
stored in buffer memory 12 are stored in another portion of the memory so 
as not to interfere with transfer of the previous two hundred binary words 

25 to memory 15 in processor 14. 
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The two hundred binary words resulting from digitization of the input 
voice signal that have been transferred to data memory 15 are first signal 
processed by the Time Domain Harmonic Compression (TDHC). This known 
technique is not described in great detail herein but is described in 
5 detail in the following cited articles which are incorporated by reference 
herein. These references are: 

1, "Time Domain Algorithms for Harmonic Bandwidth Reduction and 
Time Scaling of Speech Signals", by D. Malah, IEEE Transactions 

10 Acoustics, Speech and Signal Processing , Volume ASSP-27, No. 2, 

PP. 121-133, April 1979; 

2. "Combined Time Domain Harmonic Compression and OJS'D for 7;2 k 
bit/s Transmission of Speech Signals", by D. Malah, Proceedings 

15 of IEEE ICA5SP , Denver, Colorado, 1980, PP. 504-507, April, 1980; 

3. '?tew ^proach to Speed Digitization Combining Time Domain 
Harmonic Scaling Adapative Residual Coding", by J. L. Nfelsa and 
A. K. Pande, Final Report on DCA 100-80-C-0050 , August, 1981; and 

20 

4, "On the Use of Autocorrelation Analysis for Pitch Detection", by 
L. R. Rabiner, IEEE Transactions on Acoustics, Speech and Signal 
Processing , Volume ASSP-25, 1, PP. 24-33, Feb. 1977. 

25 This Time Domain Harmonic Compression [TDHC] processing technique 

looks at the digitized voice signals to locate the peak of the auto 

1 7 
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correlation of the wave forms. Frora this information the pitch period of 
the voice is derived and, in addition, two pitch periods of digitized 
voice information are averaged together utilizing a triangular weighting 
function into a coinpressed set of data yielding a 2:1 data reduction. In 
5 the prior art, this TTHC processing is all the signal processing that 
would be done to a digitized voice signal to compress the data. However, 
in accordance with the teaching of our invention we further process the 
already compressed data to further compress same. 

10 The next major signal processing done on the already 2:1 compressed 

data is with the laiown Continuously Variable Slope Delta Nfodulation 
processing technique. Normally, for this technique to function properly 
the sample rate input thereto must be at least thirty-two thousand samples 
per second. Following the TDHC processing we now only have a sampling 

15 rate of four thousand samples per second which is insufficient for proper 
Q^D processing. To circunvent this problem the partially compressed data 
obtained utilizing TEHC processing upsampled by a factor of four utilizing 
a known interpolation technique to yield a total sample rate of sixteen 
thousand samples per second. This is normally insufficient for prc^r 

20 O.SD processing but the prior TDHC processing has compressed the frequency 
spectrum of the digitized voice by a factor of two so the CV5D processing, 
while only receiving a sample rate of sixteen thousand samples per second, 
produces the same results as if it is operating with a sample rate of 
thirty-two thousand samples per second- This CVSD processing is known in 

25 the art so is not disclosed in great detail herein but is described in 
great detail in the following listed references which are incorporated by 
reference herein. These references are; 



01 1 SJ-71 



1. 



"I 



'Continuous Delta Modulation", J. A. Greefkes, Phillips Res. 



RfiDt., No. 23, PP. 233-246, 1968; 



2.. ' "Delta Modulation Systems ", R. Steele, Halsted Press, London, 



5 



1975; and 



3. 



"Specifications Nfanual for MC3418, Continuously Variable Slope 
Delta Nfodulator /Demodulator ", Motorola Semiconductors, Phoenix, 



Arizona. 



10 



The 1:4 upsainpling and 4:1 downsampling mentioned hereinafter is not 
described in detail herein but is described in detail in "A General 
Program to Perform Sampling Rate Conversion of Data by Rational Ratios", 
by R, E, Crochiere of Bell Laboratories in >furray Hill, N.J. , Prograips For 
15 Digital Signal Processing , IEEE Press, 1979, pages 8.2-1 to 8.2-7. 

The CVSD signal processing yields one binary bit output for each 
binary word input thereto. Thus, for the two hundred binary words stored 
in data memory 15 for a small segment of speech, a total of two hundred 

20 binary bits are out from the CVST) processing. Each of these bits is 

output via busses 20 and 19 to be temporarily stored in serial -to-parallel 
converter 51 which assembles the bits into eight bit binary words. Thus, 
the two hundred bits that are output from the CVSD processing are combined 
into twenty five eight bit binary words by converter 31. Host 

25 microprocessor 23 is informed of the status of processing of the two 
hundred binary words stored in data memory 15 of processor 14. Host 
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microprocessor 23 causes each eight bit binary word assembled in 
serial -to-parallel converter 31 to be transferred therefrom via EMA 
interface circuit 30 to be either stored in bulk memory 34 or to be output 
via oiitput 25 to an external memory or to a telecommunications channel. 
5 In addition, the pitch period determined by the TDHC processing and in 
binary word form is added with the associated twenty five compressed 
binary words stored in bulk memory 34, 

l^on the two hundred voice samples in binary word form stored in data 
10 memory 15 being processed in the manner just described, processor 14 is 
now ready to receive another two hundred binary words representing 
digitized voice for further processing. At this time another two hundred 
voice samples in binary word form has been assanbled in buffer memory 12 
and these binary words are transferred via busses 19 and 20 to data memory 
15 15 which now processes this new information in the manner just described. 
Thus, voice digitizer 9 operates in real time to digitize and compress . . 
voice signals input via analog input 10, 

Figure 2 shows the functional steps performed by the hardware shown 
20 in Figwe 1 and described hereinabove. A voice signal first undergoes 

sampling and digitization to the U-2S5 LAW PCM standard in block 35. Next 
the digitized signal is linearized as shown at block 36. Then the 
digitized and linearized voice signal undergoes signal processing to 
derive the pitch period of the audio signal and to compress the digitized 
25 samples by a factor of 2:1 as represented by block 37. Next the partially 
compressed digital signals output from the TDHC processing are upsampled 



-16- 



01 1 8771 



by a factor of four as represented by block 38. The upsanrpled digital 
signals then undergo the Q'SD signal processing as represented by block 
39. The binary bits output from the CVSD processing are assembled into 
binary'Vords which in number are compressed by a factor of 4:1 when 
5 compared to the original digital signals generated at block 35, and these 
compressed signal are combined with the value of the pitch period on path 
40 to be stored in a storage or memory 34, 

Figure 3 shows the functional steps performed when a digitized and 

10 compressed voice signal, done so in accordance with the teaching of our 
invention, is read out of memory 34 and is reconverted to an audio 
signal. The digitized and conpressed data read out of storage memory 34 
includes the binary number representing the pitch period which is used in 
the TEHE processing represented at block 43. The compressed digitized 

15 voice first undergoes inverse CVSD processing to remove the effects of the 
compression caused by this form of signal processing. This inverse 
processing is }<jiovn in the art and is described in the previously cited 
references. The expanded digitized voice signal output from block 41 is 
then downs ampled by a factor of four at block 42 to remove the binary 

20 nunbers added to the digitized information by interpolation at block 38 in 
Figure 2. The value of pitch period on path 44 and the downsampled 
digitized voice signals then undergo TDHE processing at block 43 to derive 
the original binary signals created by digitizing the voice signal. When 
the operator of our novel equipment decides to change the playback speed 

25 they apply an input to the equipment to do so. V.hen it is desired to 

double the speed of playback the downsampled digitized signals output from 
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block 42 bypass block 43 and do not undergo TDHE processing, but rather go 
directly to the digital to analog conversion step. When it is desired to 
slow the playback speed the downsainpled digitized signals undergo modified 
proces^siTig to expand the number of binary words. Processor 14 is then 
5 utilized to remove the effects of linearization. Finally, the signal 
output from block 45 is in U-255 LAW PCM format which is converted from 
digital form to a voice at block 46. This playback operation briefly 
described by functions referenced to in Figure 3 is now described in more 
detail with reference to Figure 1. 

10 

In the playback mode digitized and compressed voice signals, done so 
in accordance with the teaching of our invention, are either read out of 
memory 34 in processor 7 or are obtained frora an external memory or 
teleccHimjnications channel [both not shown] via input 26. The binary 

15 words making up these digitized and compressed signals are individually 
passed via bus 27 of host processor 7, EMA. interface 30, and buss 19 to 
first buffer memory 12 from which the words are forwarded to processor 14 
for expansion and de linearization and then forwarded to serial /parallel 
converter 31 which now functions as a parallel to serial converter. In 

20 this manner the individual bits of each expressed data word is taken one 
at a time and applied to microprocessor 14. Inverse CV5D processing is 
performed, in a manner known in the art, and the expanded number of binary 
nunbers resulting therefrom are then downsampled to remove three of every 
four binary words, and more specifically those binary words created during 

25 upsampling by interpolation. The binary words output from the 

downsampling step then either bypass the TEHE processing step or undergo 
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the TEHE processing to double the playback speed, playback at normal 
speed, or playback at a reduced speed. As the digitized voice signal is 
decompressed through the steps just described the expanded number of 
binary- words then undergo delinearization to u-255 LAK POl and is then 

5 forwarded directly to D/A converter 11 to reconvert the binary words to 
the original voice signal at analog output 33. For details of how to 
implement a twelve bit to eight bit conversion when reconverting 
compressed digitized voice signals back to voice see "A New Digital 
Technique For Lnplementation of any Continuous PCM Companding Law", by M. 

10 Villeret, P. A. Deschenes and H. Stephenne of the Electrical Engineering 
Department of the University of Sherbrooke, Sherbrooke, Quebec, Canada; 
Conference Record, 1973 IEEE International Conference on Coironunications . 
Volune 1, pages 11-12 to 11-17, 

15 V^hile what has been described hereinabove is the preferred embodiment 

of the invention it will be obvious to those skilled in the art that 
numerous changes may be made without departing from the spirit and scope 
of the invention. For example, if the analog -to -digital conversion 
sampling is performed at thirty two thousand samples per second, 1:4 

20 upsampling is not required before CVSD signal processing. In addition, 
other types of signal processing may be utilized. Instead of TDHS 
processing another process that derives the pitch of the voice signal and 
compresses the spectrum while reducing the number of binary words 
representing the voice signal may be utilized. Similarly, instead of CVSD 

25 processing another delta modulation process that encodes binary words 
representing sample points as a binary bit representing the slope of the 
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voice signal may be used. In addition, other wavefom coding techniques 
may be utilized to encode the binary words. Also, TDHS and /or OJSD or 
alternative processing may be undergone more than once in compression and 
expansion. In addition, for playback speed control binary words may be 
5 input to the appropriate signal processing step to affect the degree of 
binary word expansion. Further, one skilled in the art could detect quiet 
periods or silence and not digitize these, but further compact the speech 
infonnation. In addition, the backing of our invention may be utilized in 
modems, dual tone multi frequency generators and receivers, voice 
10 synthesis and recognition, and other signal processing applications due to 
its programmable nature - 
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What we claim is: 



1. A method for compressing a first number of birxary words representing a 
continuously varying signal over a given period of time where said varying 
signal has been sampled at a predetermined rate per unit time and the samples 
digitized to said first number of tinary words, with the varying signal having 
a pitch value, and comprising the steps of: 

extracting the pitch value of said varying signal from said first number 
of binary words and indicating the pitch value by a binary number, 

reducing said first number of binary words using a first signal processing 
technique to a second number of binary words, and 

processing said second number of binary words using a second signal 
processing technique that processes binary words representing sample 
points of said varying signal to encode each of said second number of 
binary words as a binary bit representing the slope of said varying - 
signal, to thereby reduce said second number of binary words to a number 
of binary bits. 

2. The method in accordance with claim 1 further comprising the step of: 

assembling the bits provided per said second signal processing technique 
into multi bit binary words, said multi bit binary words and said binary 
word representing said pitch value representing said continuously varying 
signal and being compressed substantially in the order of four to one in 
comparison to said first number of binary words representing said varying 
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3. The method in accordance with claim 2 further comprising the step of 
increasing said second number of binary words resulting from said first signal 
processing technique by interpolating between each of said second number of 
binary words- because said second number of binary words are insufficient for 
proper processing using said processing step. 

4. A method for digitizing a continuously variable signal and reducing the 
number of binary words obtained from the digitization comprising the steps of: 

digitizing said vaiying signal to obtain a first number of binary words 
representing said varying signal over a given period of time, 

extracting the pitch value of said varying signal frcm said first niiaber 
of binary words and indicating same by a binary number, 

reducing said first number of binary words using a first signal processing 
technique to a second number of binary words, and 

processing said second nmber of binary words using a second signal 
processing technique that processes binary words representing sample 
points of a varying signal to encode each of said second number of binary 
words to reduce each of said second number of binary words to a smaller 
binary representation. 

5. The method in accordance with claim 4 wherein said digitizing step 
ccnprises the steps of: 



sampling said varying signal at a plurality of points and generating a 
binary number for each sample; and 
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processing each binary number from the sampling step and changing each 
binary number as necessar>' to correct for non linearities inherent in the 
sampling step, and thereby generate said first number of binary words. 

6. The method in accordance with claim 5 further comprising the step of 
assembling the smaller binary representations provided per said second signal 
processing technique into larger binary words, said -larger binary words and 
said binar>' word representing said pitch value representing said continuously 
varying signal but being compressed substantially in the order of four to one 
in comparison to said first number of binary words representing said varying 
signal. 

7. The method in accordance with claim 5 further comprising the step of: 

increasing said second number of binary words resulting from said reducing 
step by interpolating between pairs of said second number of binary words 
because said second nunber of binary words are insufficient for proper 
processing using said processing step. 

8. The method in accordance with claim 6 further comprising the step of 
assembly said first binary numbers obtained iri digitizing said continuously 
varying signal into blocks of binary numbers upon which said pitch extracting 
and reducing steps are performed. 

9. A method for accomplishing playback speed control in a system that 
digitizes a continuously varying signal into a first number of binary words 
and then utilizes a signal processing technique that utilizes said first 
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niinber of binary words to derive the pitch of the varying signal and 
represents it as a binary word .gnd also reduces the first number. .gf binary 
words to a second number of binary words also representing said varying 
signal, and_ playback of said varying signal .is accotnplished by processing said 
second nunber of binary words and said pitch binary nunber in an inverse 
manner using said signal processing technique to derive said first number of 
binar/ words which are then reconverted to said varying signal by digital to 
analog conversion, comprising the step of: 

modif>'ing said signal processing technique for playback to expand each 
second binary word into a plurality of binary words different than said 
first nunber of binary words and when said plurality of binary words are 
converted into said continuously varying signal the speed of playback of 
said var>'ing signal is different than the speed of the origonal ^varying 
signal that was digitized. 

10. Tne method in accordance with claim 9 further COTiprising the step of 
decreasing said second nunber of binary nunber s to a reduced niHiiber of binary 
words by deleting periodic ones of same, said reduced number of binary words 
then being processed by said signal processing technique in an inverse raanner. 

11. A method for accomplishing playback speed control in a system that 
digitizes a continuously varying signal into a first number of binary words 
and then utilizes a signal processing technique that utilizes said first 
nunber of binary words to derive the pitch of the varying signal and 
represents it as a binary word and also reduces the first number of binary 
words to a second nunber of binary words also representing said varying 
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signal, and playback of said varying signal is accomplished by processing said 
second nimher of Mnary words and said pitch binary number in an inverse 
manner using said signal processing technique to derive said first number of 
binary words_ which are then reconverted to said varying signal by digital to 
analog conversion, comprising the step of; 

applying said second number of binary words directly to said digital to 
analog conversion to increase the playback speed which is doubled when 
said second number of binary words is half the first number of binary 
words. 

12. The method in accordance with claim 11 wherein said continuously varying 
signal is played back at the sme speed it was originally when digitized when 
said second number of binary words is converted to said first number of binary 
words by inverse processing using said signal processing technique and further 
comprising the step of: 



15. The method in accordance with claim 12 further coTrrprising the step of 
decreasing said second number of binary numbers to a reduced number of binary 
words by deleting periodic ones of same, said reduced number of binary words 
then being processed by said signal processing technique in an inverse manner. 
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